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To serve as a dispersion relation, a cotangent bundle function must satisfy three simple al- 
gebraic properties. These conditions are derived from the inescapable physical requirements 
to have predictive matter field dynamics and an observer-independent notion of positive 
energy. Possible modifications of the standard relativistic dispersion relation are thereby 
severely restricted. For instance, the dispersion relations associated with popular deforma- 
tions of Maxwell theory by Gambini-Pullin or Myers- Pospelov are not admissible. 

INTRODUCTION 

In the standard description of relativistic matter, the dispersion relation that governs the be- 
haviour of matter is given in terms of a Lorentzian metric. In recent years, however, numerous 
authors have made a case for various modified dispersion relations pJEj, mostly motivated from 
specific approaches to quantum gravity [8lFi8| or other particular physical and mathematical models 

[MSI- 

In this paper, we address the question of what can be said about dispersion relations in general, 
and most importantly how they are restricted in principle, independent of their physical moti- 
vation. Starting only from the fundamental physical requirements that the underlying theory be 
predictive and that there be a well-defined notion of positive energy, we show that there are three 
inescapable algebraic conditions that a modified dispersion relation in classical physics must satisfy. 
This severely restricts possible modifications to the standard relativistic dispersion relation. More 
precisely, the central result we arrive at in this article is that 

A dispersion relation must he given by a cotangent bundle function P that is 
a reduced, hi-hyperbolic and energy distinguishing homogeneous polynomial in each fibre. 

These conditions, whose technical definition and justification will be the subject matter of the first 
four sections of this article, all essentially root in the application of well-known results from the 
theory of partial differential equations, real algebraic geometry and convex analysis to the kind of 
questions one considers in classical physics. 
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Remarkably, the above conditions are not only physically necessary, but also mathematically 
sufficient to set up the entire kinematical machinery one needs in order to give physical meaning 
to quantities on spacetimes with such dispersion relations. The central role is played here by 
Gauss and Legendre maps, which provide a proper duality theory between cotangent and tangent 
spaces. Their existence is far from trivial once one leaves the Lorentzian metric framework, and 
indeed we will see that one requires some rather sophisticated real algebraic geometry that was 
originally conceived in the solution of Hibert's seventeenth problem. Once these theoretical issues 
are clarified, though, one sees that indeed everything that one wants from a Lorentzian metric 
in standard relativity is equally well afforded by a cotangent bundle function with the said three 
properties. And, this is maybe the most important point, only by such. Thus the present work 
defines the outer boundaries of what constitute physically viable classical spacetime structures. 

To arrive at the above results requires numerous steps and occasional asides on mathematical 
techniques. The generality of the obtained result, however, makes this worth the effort. In order to 
provide the reader with an intuition for how the argument proceeds, we now briefly outline what 
is shown in each section. 

In section [TJ we study how linear matter field dynamics on an arbitrary tensorial geometry give 
rise to a massless dispersion relation. This will clarify two important points. On the one hand, we 
will see that any massless dispersion relation must be described by a cotangent bundle function 
P that induces a reduced homogeneous polynomial in each cotangent space. This is the first of 
the three algebraic conditions identified in this work. On the other hand, it reveals the close 
link between dispersion relations and the underlying spacetime geometry seen by specific fields. 
Thus the restrictions on dispersion relations derived in the course of the paper directly translate 
into restrictions of the underlying geometry seen by fields. In section [2j we will explain that a 
necessary condition for the matter field equations to be predictive is that the cotangent bundle 
function P be hyperbolic in each cotangent space. Here we will make extensive use of the theory of 
hyperbolic polynomials and prove two important technical lemmas. One of these will ensure that 
for the set of roots of reduced hyperbolic polynomials a real version of Hilbert's Nullstellensatz 
holds, which will be of great technical importance later. In section [3j our focus changes from 
cotangent space to the geometry which a massless dispersion relation induces on tangent space. In 
particular, we associate vector duals with massless momenta. A central role is played here by what 
algebraic geometers call the dual polynomials with respect to those induced by P. The physical 
significance is that the dual polynomials emerge as the tangent space geometry seen by massless 
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point particles, but not massive point particles as we will see later. In section |4j we will see that 
not only the polynomials defined by P, but also their dual polynomials, must be hyperbolic. 
This is our second algebraic condition on dispersion relations. In conjunction with the third and 
final algebraic condition, namely that the polynomials be energy-distinguishing, this is necessary 
in order to have a well-defined notion of positive particle energy on which all observers agree. 
Thus at the end of this section, we identified all three algebraic conditions for a cotangent bundle 
function P to serve as a physically viable massless dispersion relation. In section [5j we extend the 
theory to massive dispersion relations. Bi-hyperbolicity and the energy-distinguishing property, 
previously recognized as essential properties in the context of massless dispersion relations, is 
shown to also play a crucial role when discussing massive matter. In particular, they imply an 
reverse triangle and an inverse Cauchy-Schwarz inequality and, above all, ensure the existence of 
the Legendre duality theory for massive covectors employed in the succeeding section. In section 
[6| the action for free massive point particles is analyzed. Compared to the dual tangent space 
geometry seen by massless particles, the dual geometry seen by massive particles is encoded in 
another, generically non-polynomial, tangent bundle function. The generic emergence of two 
different tangent bundle geometries is an important point, since it obstructs any attempt to devise 
a Finslerian extension of Lorentzian geometry which would be able to capture the massive and 
massless particle behaviour in one single tangent bundle structure. In section [7j we show why the 
decidedly covariant discussion of dispersion relations, which we adopted throughout this work, is 
required even if one ultimately prefers to represent the dispersion relation in form of a function 
E(p), expressing the energy of a particle in terms of its spatial momentum. This will be seen 
to be the case because the required temporal-spatial split of a covariant particle momentum is 
only defined in terms of the covariant dispersion relation. Pushing the theory of observers and 
frames further, we identify the generically non-linear parallel transport induced by a bi-hyperbolic 
and energy-distinguishing dispersion relation and thus succeed in defining inertial laboratories. 
In section [8j we put our understanding of massive and massless momenta to work. By using 
virtually the entire machinery developed before, we reveal a generic high-energy effect for general 
dispersion relations. In particular, we determine the maximum energy a massive particle can 
have without radiating off, sooner or later, a massless particle in non-Lorentzian geometries. The 
geometric picture reveals that this is a fully covariant feature induced by the spacetime geometry. 
In section [9j we illustrate in detail how easy it is to check whether concrete field equations 
possess a bi-hyperbolic and energy-distinguishing dispersion relation. Remarkably, the popular 
deformations of Maxwell theory by Gambini-Pullin or Myers-Pospelov do not pass this test and 
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are thus recognized to be non-predictive and to obstruct a well-defined notion of positive energy. 



Finally, in section 10 we draw conclusions from what has been learnt, indicate limitations of the 
results we obtained, and point out remarkably interesting issues that one may now study based 
on the results of the present work. 



Throughout the paper, the abstract theory is illustrated by showing how the constructions work 
out for the familiar example of metric geometry on the one hand, and for area metric geometry 
as a prototypical example of a non-metric geometry, where our general techniques come into full 
play, on the other hand. Of course, and this is one purpose of this article, the reader may instead 
study his own favourite candidate for a spacetime geometry or dispersion relation by the techniques 
developed in this paper. The results presented here apply universally. 



1. COVARIANT DISPERSION RELATIONS I: MASSLESS PARTICLES 

The dispersion relation for massless matter is determined by the entirety of matter field 
equations one stipulates. We conclude in this section that the dispersion relation is encoded in a 
cotangent bundle function P that induces a reduced homogeneous polynomial in every tangent space. 



To understand how a massless dispersion relation arises from matter field equations, we consider 
a (gauge fixed) action S[Q>,G] for a field multiplet = (& N ), where G is an a priori arbitrary 
tensor field encoding the geometry of a smooth manifold M on which the matter field dynamics 
are defined. Here we will notably not restrict attention to pseudo-Riemannian geometries, but 
rather allow a priori for any tensorial structure G to play the role of the geometry. Indeed, it is one 
of the points of this article to find how the choice of G is restricted if it is to provide a consistent 
classical spacetime structure. Matter field dynamics, however, are restricted from start to those 
giving rise to linear field equations, since only these can serve as test matter probing the geometry. 
More precisely, the manifestly local equations of motion then take the form 

= 0, (1) 

where small latin indices range from to dimM — 1, and the coefficient matrices Q at all orders 
depend only on the geometry G but not on the value of the fields & N ; note that this is really 
only true for genuinely linear field equations, and does not even hold for the linearization of 
fundamentally non-linear dynamics [67] • Remarkably, the theory of partial differential equations 
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reveals that the entire causal structure of the local field equations are encoded in the principal 
polynomial P associated with the equations ([!]). The latter is defined as the leading order term 
P{q) of the scalar function ojq det Dmn^Q) on covectors q, where ojq is some scalar density of 
appropriate weight, constructed from the tensor G such that P is a polynomial function in each 
cotangent fibre. We will see in the following sections that the choice of the scalar density log does 
not affect the dispersion relation for massless matter, but that it is relevant for the dispersion 
relation for massive matter. 

Concrete illustrations of this truly simple construction are given at the end of this section, where 
we determine the cotangent bundle function P for abelian gauge field dynamics on metric and area 
metric geometry as two prototypical examples, and then again in section [9] where we are able to 
show the non-physicality of other popular deformations of Maxwell theory, based on the results 
devoleped throughout this paper. The central observation of this section is that P is recognized to 
induce a homogeneous polynomial P x of some degree deg P in each cotanget space. 

The field equations ([!]) may be viewed in the geometric-optical limit whenever the leading order 
term of the above determinant in fact coincides with the determinant of the leading order operator 
of the field equations, 

P(x,q) =oj g det(Q h -^(x) qil ...q is ), (2) 

which indeed is guaranteed to be the case if ^ is not identically zero. This construction ^ is 
covariant in the first place because for fixed M, N, the leading order coefficients Q % li'^ a , and indeed 
only those, transform like a tensor, and so the determinant is a tensor density whose weight depends 
on the nature of the fields & N and must be countered by the weight of ujq. A concrete illustration 
of this construction is given at the end of this section, where we determine the cotangent bundle 
function P for abelian gauge field dynamics on metric and area metric geometry as two prototypical 
examples. As we will show now, the physical role of P is that it provides the massless dispersion 
relation that arises in the geometric optical limit of Q, in form of the solvability condition 

P(x,q) = 0. (3) 

This is seen by considering matter solutions in the short-wave approximation |40].|41].|42|.|43|. 
where one considers solutions of ([I]) taking the form of the formal series 

oo 

<^(x,A) = e^ J>f(x)A^' (4) 

3=0 
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and then obtains an approximate solution taking the limit A — > 0. In the above expansion, <j>^ (x) 
is a tuple of functions for each j, and the scalar function S(x) is known as the eikonal function. 
Substituting the formal series Q into the field equations ([!]), one finds 



■ S(x) 

e l a A~ s 



QMN(x) h - ia d h S-- ■ di s S<f>$(x) + Y,v Mj {x)\ j 

3=1 



0, (5) 

where each of the VMj( x ) terms depends on some of the matrix coefficients of the differential 
equation (flj), on coefficients 4>^ (x) of the expansion (jZJ) and on the eikonal function S and its 
derivatives of lower than the highest order s. For Q N (x) to be a solution after any trun- 
cation of the series ([5]), the latter has to vanish order by order in A. Clearly, the first term 
e 1 ^^ QMN(x) n ls dhS ■ ■ ■ di s S 4>q (x), corresponding to the power A _s , vanishes with non-trivial 
4>q only if the eikonal function S satisfies the differential equation 

P(x, dS) ~ det (Q(x) h - is d h S ■■■ d ia S) =0, (6) 



where P(x, dS) is recognized to be a homogeneous polynomial in dS. Equation ^ is known as 
the eikonal equation and represents the solvability condition for the first term in Considering 
the first term in ^ as an approximate solution for ([!]) is what is called the geometric optical 
limit and its further relevance, beyond the role it plays for us here, is that having this lower order 
approximate solution, one can generate higher order approximate solutions. More details about 
the short-wave approximation, albeit only for scalar fields or otherwise only first order equations, 
can be found in the books by Perlick |41j . Egorov and Shubin [ID] and the lecture notes by Rauch 
|32] . In [23] equations of the type ([T]) are discussed. 

In case P is a reducible polynomial in each fibre, i.e., a product P(x, q) = Pi(x, q) ai ■ ■ ■ Pf{x, q) a f 
of irreducible [68] factors P±, . . . ,Pf with positive integer exponents a\, . . . , a/, subtleties arise. In 
that case |40].|41] one has to take as the cotangent bundle function P the reduced polynomial 

P(x,q) = P 1 (x,q)---P f (x,q), (7) 

in other words, one must remove repeated factors in the original polynomial. We will have more 
to say about the relation between properties of a reduced polynomial and those of its individual 
factors in section [2j 

Clearly, this cancelation does not alter the set N x of massless momenta at a spacetime point x, 
which due to the homogeneity of P x constitutes the algebraic cones 

N x = {k€T*M\P(x,k) = 0}, (8) 
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cotangent space cotangent space cotangent space 

FIG. 1: Homogeneity of P in its cotangent fibre gives rise to a cone of P-null covectors in each cotangent 
space. Three prototypical examples of such cones are shown, the first one being the familiar Lorentzian metric 
cone. Only for the second example do we have N x ^ N* ulooth , with the difference set being constituted by 
the covectors lying in the intersection of the cones. 

which is to say that every positive real multiple of a massless momentum is again a massless 
momentum, cf. figure [I] For technical precision, we will occasionally focus on the smooth subcone 

^smooth = { k€Nx \ DP ( X> k)^0}, (9) 

where DP denotes the derivative of P with respect to the cotangent fibre. So the cotangent bundle 
function P determines the (smooth) massless momentum cone. The converse question, namely 
under which conditions the massless momentum cone N x at a point x determines the polynomial 
P x up to a constant factor, is subtle, but of central importance. The vanishing sets associated with 
polynomials are the subject of study of algebraic geometry, and we will indeed have opportunity to 
employ some elaborate theorems of real algebraic geometry. In the remainder of this first section, 
we clarify the relation between vanishing sets of real polynomials and the principal ideals that 
these polynomials generate, since this will be relevant later. Recall that an ideal / C R in a ring R 
(where R is here concretely the ring of real polynomials in dimM real variables) is a subset that 
is closed under addition and under multiplication with an arbitrary ring element. Now on the one 
hand, we may consider the situation where we are given an ideal / and define the vanishing set 
V(I) as the set of cotangent vectors that are common zeros to all polynomials in /. On the other 
hand, we may be given a subset S of cotangent space and consider the set I(S) of all polynomials 
in R that vanish on all members of that set S. Now it can be shown that I(S) is an ideal in the 
ring of polynomials on cotangent space, and that one always has the inclusion 

X(V(I))2I. (10) 



s 



The question under which conditions equality holds is studied in the Nullstellensatze of algebraic 
geometry. While this is a relatively straightforward question for polynomials over algebraically 
closed fields [44], such as the complex numbers, for the real numbers underlying our study here, 
one needs to employ a string of theorems that were originally developed in order to solve Hilbert's 
seventeenth problem. Indeed, for a reduced homogeneous polynomial P x , one obtains the equality 

I(N X ) = (P x ) , (11) 

if _/V| mooth ^ (which condition we will be able to drop for the hyperbolic polynomials to which 
we will narrow our attention from the next section onward, see the remarks following the First 
Lemma proven there). Here {P x ) denotes the ideal containing all polynomials that have P x as a 
factor. Drawing on the said results from real algebraic geometry, this is seen as follows. Let P x i 
be the ith irreducible factor of P x Then there exists a q S N smooth (P x j) so that corollary 2.9 of 
|45j shows that P x i generates a real ideal, i.e., Z{N(P x i)) = (P x i). According to corollary 2.8 of 
[4"5] . the reduced polynomial P x thus also generates a real ideal since it does not contain repeated 



factors. Finally theorem 4.5.1 of [36] yields the claim. The equality (11) will play a significant 
technical role in ensuring that we can determine the vector duals of massless momenta using 
elimination theory, in section [3] 



Illustration: Maxwell theory on metric and area metric backgrounds 

We now illustrate, by way of two concrete examples, how the cotangent bundle function P and 
thus the massless dispersion relation are extracted from a field theory on a given geometry. 

Our first example, Maxwell theory on a metric background, is of course a classic problem, and 
developed in full detail for instance in [41J. Here we present only those results which are illuminating 
with regards to the present work. So let (M,g) be a metric manifold of arbitrary signature, and 
consider a one-form field A whose dynamics is governed by its coupling to the metric tensor g 
according to the Maxwell action 

S[A,g] = -\f <PxyJ\ det(g)\g am g bn F mn F ab , (12) 

where F = dA is the field strength. Moreover, an orientable metric manifold carries a canonical 



volume form uj g . =| det(g) I 1 / 2 e a b c d, so that after variation of the action (12) with respect to 



the one-form field A, and after introducing coordinates x a = (t,x a ), one can rewrite the obtained 



9 



second order field equations for A as a system of first order field equations (plus two constraint 
equations) for the electric field E a = F(dt,d a ) and the magnetic field B a = ajq 1 (dt,dx a ,F) as 



A bM N d b + B M N ) u N = 0, (13) 

where u N = (E a ,B a ) and the matrices A bM n depend on the metric tensor g and the volume 
form uj g , see |41j for the exact dependence. Strictly speaking, choosing the field strengths as 
dynamical variables does away with the gauge symmetry not precisely by gauge-fixing the action, 
as was assumed in the general part of this section, but achieves the same at the level of the 
equations of motion and was chosen because it allows for a more concise discussion here. In any 
case, the system (13) is a particular case of 0, so that the massless dispersion relation is given 



by some scalar density P x (q) = det(A b qb) = 0. Inserting the explicit expression for A bM n, one 
finds P x (q) = <?o(ftr H*?) q)) 2 - To obtain the massless dispersion relation, we must cancel repeated 
factors in P x (q), as explained above. Moreover, one can show [32] that qo = is inconsistent with 
the constraint equations, so that we finally obtain the massless dispersion relation on a metric 
background as derived from Maxwell field equations as P x (q) = 9 x l {q-, q) = 0, the familiar result. 

As a second example, we discuss area metric geometry. We provide only the most basic defi- 
nitions and results needed for the purpose of this example; for a more detailed introduction, see 
e.g. [47j,|48j,[49j. An area metric manifold (M, G) is a smooth differentiable manifold M equipped 
with a smooth covariant fourth rank tensor field G with the algebraic symmetries G a b c d = Gcdab 
and G a bcd = Gbacd- Moreover, an area metric is required to be invertible in the sense that there is 
a smooth tensor G abcd so that G abpq G pqc d = 2{5^5 b d — S^S b ). In particular, an area metric G can be 
viewed as a metric Petrov (G) in the space of two forms and it induces the canonical volume form 
(uj g ) abed = (det (Petrov (G))) 1/6 tabcd f° r dim Ad — 4. Maxwell theory on an area metric background 
was fully studied in |50| . |49j and here we only summarize the main results. We consider a one-form 
field A coupled to an area metric background according to the action 

S[A, G] = ~\j d 4 x\ det (Petrov (G)) | 1/6 F ab F cd G abcd , (14) 



where F = dA is the field strength. After variation of the action (14) with respect to the one 



form field A and after introducing coordinates x a = (t, x a ) one can rewrite, as in the metric case, 



the obtained second order field equations for A as the system (13) of first order field equations 
(plus two constraint equations). For this case u N = (E a ,B a ) with electric field E a = F(dt, d a ) and 
magnetic field B a = w^ 1 (dt, dx a , F), and the matrices A b&1 n depend now on the area metric tensor 
G and the volume form uq. For the explicit dependence see [29]. Then the massless dispersion 
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relation must again be given by P x (q) = det(A b q^) = 0. After an explicit calculation one finds 

Px{q) = qo p x(q) with 

P x (q) = -i( WG J mnp5 ( UG J rstu Gr r(a G* s|c G* ta g ag6 g c g d . (15) 

One can also show, as in the metric case, that qo = is inconsistent with the constraint equations, 
such that we finally find that the massless dispersion relation on an area metric background as 
derived from Maxwell field equations is given by P x (q) = 0. This result has been obtained first by 
Hehl, Rubilar and Obukhov [5T] , [52j in the context of premetric electrodynamics. 

2. HYPERBOLICITY 

Employing our knowledge on how a cotangent bundle function P encoding the massless 
dispersion relation arises from field equations, we now identify the hyperbolicity of P as 
a crucial condition for the field equations to be predictive in the first place. For the cone of 
massless momenta, the real Hilbert Nullstellensatz is thus shown to hold without further conditions. 

The second property required for a cotangent bundle function P to provide a viable massless 
dispersion relation, besides being a reduced homogeneous polynomial in the fibre coordinate, also 
originates in the underlying matter field equations ([I]) and goes right to the heart of what classical 
physics is all about. Namely that the theory be predictive. In other words, initial data on a 
suitable initial data surface are required to evolve in a unique manner. Remarkably, the question 
of what constitutes suitable initial data surfaces on the one hand, and the question of whether the 
field equations evolve the initial data in a unique way on the other hand, are both decided by an 
algebraic property of the cotangent bundle function P. More precisely, an inescapable condition 
for the initial value problem to be well-posed in a region of spacetime is that P defines a hyperbolic 
polynomial P x at every point x of this region |53|.|54j. A homogeneous polynomial P x is called 
hyperbolic with respect to some covector h if for every covector q with P x (q) ^ any A solving 

P x (q + Xh)=0 (16) 

is a real number. This definition of hyperbolicity is easy to understand in geometric terms. It 
simply means that there is at least one covector h such that every affine line in cotangent space in 
the direction of h intersects the cone defined by ^ in precisely degP points, see figure [2j counting 
algebraic rather than geometric multiplicities. Any such covector h identifying P x as hyperbolic is 
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cotangent space 



cotangent space 



FIG. 2: Hyperbolicity cones for two prototypical polynomials. On the left the familiar second degree 
Lorentzian cone; on the right a fourth degree cone defined, for simplicity, by a product of two Lorentzian 
metrics. 



itself called a hyperbolic covector at the point x. The various connected sets of hyperbolic covectors 
in the same cotangent space, as for instance the upper (shaded) cones in figure [2j are called the 
hyperbolicity cones of P at x. It is often useful to take a more global point of view and consider 
a smooth distribution C of hyperbolicity cones C x over all spacetime points x, which one simply 
may think of as the cone of all smooth covector fields h for which h x G C x . More precisely, let h 
be a covector field hyperbolic with respect to P, that is h defines a hyperbolic covector at every 
spacetime point. Then the hyperbolicity cone C(P,h) containing h is constituted by all covector 
fields q with the property that all functions A on M satisfying 



P(x,q(x) - X(x)h(x)) = 



(17) 



are positive everywhere on M. The cone C(P, h) induces a cone C X (P, h) in each cotangent space 
T*M, consisting of the values q(x) of all q £ C(P, h) evaluated at x, which is called the hyperbolicity 
cone of P with respect to h at x. Clearly, C X (P, h) only depends on the value of h at x, and thus 
one may think of C(P,h) simply as the said distribution of the C x (P,h) over all x G M. The 



somewhat implicit definition of hyperbolicity cones, both the local (16) and the global one (17), 



can be cast into the explicit form of deg P polynomial inequalities, as was proved in [55J from the 
Routh-Hurwitz theorem, if at least one hyperbolic covector within the cone one desires to describe 
is known. More precisely, let P be hyperbolic with respect to h so that P(x,h(x)) > without 
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loss of generality. Then the hyperbolicity cone is described by the deg P inequalities 

det Hi{v , h) > for all i = 1, ... , deg P , 
where the matrices H i , i?2 > • • • > Hdeg p are constructed as 



(18) 



Hi(v,h) 



hi 


h 3 


h 5 . 


• ^2i-l 


ho 


h 2 
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• h2i-2 





hi 


h . 


■ t>,2i-3 
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h 2 . 


■ hvi-4 








. 


hi 



where hj is set to for j > i 



(19) 



from the coefficients of the expansion 

P{x,v + Xh) = h {x,v,h) X degP + hi(x,v,h) X^^ 1 + --- + h degP (x,v,h). (20) 

Similar to algebraic geometry dealing with algebraic sets defined by polynomial equations, there is 
a rather elaborate theory of semi-algebraic sets |56j.|46| defined by polynomial inequalities, of which 
the hyperbolicity cones are, according to the above theorem, a particular instance. Employing this 
theory will be of advantage in the proof of the first lemma, below. The remarkable properties of 
hyperbolicity cones, which underlie all further constructions, have been elucidated by Garding [57] 
a long time ago. Recalling that a subset C of a real vector space V is called a convex cone if 
besides any real multiple sof an element of C also the sum of any two elements of C lies again in C, 
Garding proved the following results. First, any covector field b belonging to a given hyperbolicity 
cone C(P,a) equally represents the hyperbolicity cone, C(P,b) = C(P,a). Second, C(P,a) is an 
open and convex cone. Third, P is strictly non-zero on C(P,a), but vanishes on the boundary 
dC(P,a). Finally, the suitable initial data surfaces, alluded to at the beginning of this section, are 
recognized as those whose normal covector fields are hyperbolic. 

The calculation of hyperbolicity cones is significantly simplified if the cotangent bundle function 
P is factorizable into factors of multiplicity one, as in eq. Q. Then P is hyperbolic with respect to 
h if and only if each of its individual factors is hyperbolic with respect to h. For such reducible P x , 
the determination of the hyperbolicity cone with respect to some hyperbolic covector h is reduced 
to the determination of the hyperbolicity cones of the individual factors, since 



C x (Px, h x ) — C x (Pi x , h x ) n • • • n C x (Pf x , h 2 



(21) 
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Thus it is not a coincidence that the hyperbolicity cone indicated in figure [2] is the intersection of 
the hyperbolicity cones of the two Lorentzian cones whose union constitutes the vanishing set of 
the underlying dispersion relation. We conclude this section with the proof of two key properties 
of hyperbolic polynomials, which we will use repeatedly throughout this paper. 

First Lemma. For a reduced homogeneous hyperbolic polynomial P x , the set jVJ mooth is a dense 
subset of the cone N x of massless momenta. 



Proof. Since our variety N is generated from a single polynomial P, i.e., (P) = I(N), it follows 
from Definition 3.3.4 of [46J that the set of sing ular points is Sing(iV) = JV \ N smooth . But then 
dimSing(A^) < dimiV = dimM — 1, where the inequality is Proposition 3.3.14 of [46J and the 
equality follows from the hyperbolicity of P [58] . Thus we know that the singular set is at most of 
dimension dimM — 2. Further, we know from the first remark in 3.4.7 of |56j that Sing(iV), being a 
real algebraic set, can be expressed as a finite union of analytic semialgebraic manifolds S{ and that 
every such manifold has a finite number of connected components. From the propositions 2.8.5 and 
2.8.14 of [46j we thus obtain that dimSing(iV) = max(dim(S'j)) = max(d(<Sj)), where d(Ai) is the 
topological dimension of the semialgebraic submanifold Si C T*M. Since dimSing(iV) < dimM — 2 
we conclude that Sing(iV) consists of only finitely many submanifolds of M. n of topological dimen- 
sion less or equal to dimM — 2. Thus its complement AP 311100 * 11 = N \ Sing(iV) is dense in N. 



This property of hyperbolic polynomials is also mentioned in |58| . As an important corollary 



we obtain that the real Nullstellensatz (11) holds for any reduced hyperbolic polynomial without 
further conditions; for if P is hyperbolic, certainly N x is a non-empty set of codimension one, so 
that the dense subset jV| mooth must be non-empty. 



Second Lemma. If P x is a reduced homogeneous hyperbolic polynomial with hyperbolicity cone 
C x at some point x G M then for all covectors s G T*M\closure(C x ) there exists a massless 
covector r on the boundary dC x of the hyperbolicity cone such that s(DP x (r)) < 0. 



Proof. It is clear that if y G C x and s closure(C x ), the line y + As intersects the boundary 
dC x at some ro = y + Xqs for some positive Ao- Thus P x (ro) = and, since P X (C X ) > 0, we 
have P x (ro — es) > for sufficiently small positive e. Now we must distinguish two cases: First 
assume that P x (tq + es) < 0, from which it follows that iPx(fo + es) | e =o = s(DP x (ro)) < 0, 
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which proves the lemma with r := ro; Second, assume that P x (ro + es) > which is equivalent to 
^P x (ro + e«)|e=o = s(DP x (ro)) = which in turn holds if and only if DP x (ro) = (to see the 
latter equivalence assume that, to the contrary, s(DP x (ro)) = and DP x {tq) / 0; this implies 
that s must be tangential to dC x at ro, but since y lies in C x and C x is a convex cone y + As 
could then not intersect dC x at ro, which we however assumed). So to prove the lemma in this 
second case, we need to construct another r' Q G dC x that satisfies the condition s(DP x (r' Q )) < 0. 
Now since the First Lemma guarantees that the set iVJ mooth , on which DP X is non-zero, lies 
dense in N x , we can find in every open neighborhood U around ro a vector r' £ dC x such that 
DP x (r' Q ) / 0. We define z := r' — ro and y' := y + z. Since C x is an open cone, y lies in if we 
choose the neighborhood small enough, and the line y' + As intersects dC x at r' Q . Finally since 
r' G 9C X we know that -PrlVo) = and P x (r' Q — es) > 0. We conclude that s(DP x (r' )) < 0. This 
proves the second lemma with r := r' . 



So far, our considerations have focused on the geometry that is impressed by the massless 
dispersion relation on each cotangent space. In this section, we now associate vector duals in 
tangent space with the massless momenta defined by the dispersion relation. A central role is 
played by a dual polynomial P* on tangent space that is associated with the polynomial P x on 
the corresponding cotangent space. Physically, the dual polynomial emerges as the tangent space 
geometry seen by massless point particles. 

In order to associate velocity vectors with massless particle momenta in physically meaningful 
fashion, we employ the dynamics of free massless point particles. Their dynamics, in turn, are 
uniquely determined by the dispersion relation, because the Helmholtz action 



describes particles that are free due to the form of the first term and massless because of the 
Lagrange multiplier term. In the following, we wish to eliminate the momentum q and the Lagrange 
multiplier A to obtain an equivalent action in terms of the particle trajectory x only. Variation 
of the Helmholtz action with respect to A of course enforces the null condition for the particle 
momentum. Now variation with respect to q yields x = XDP x (q) for all q G iV smooth , which implies 



3. 



VECTOR DUALS OF MASSLESS MOMENTA: GAUSS MAP 




(22) 
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cotangent space tangent space 

FIG. 3: Gauss map sending the zero locus of a polynomial to the zero locus of a dual polynomial 
the weaker equation 

\DPM] = [ j , (23) 
where [X] denotes the projective equivalence class of all vectors collinear with the vector X. 



In order to solve (23) for q, we need the inverse of the projective map [DP]. We will now derive 
that this inverse is given by the gradient of a so-called dual polynomial. Indeed, the image NT of 
the massless covector cone N x under the gradient map DP is again described by a homogeneous 
polynomial Px , albeit of generically different degree than P. More precisely, for an irreducible 
cotangent bundle function P, we look for a likewise irreducible tangent bundle function P* that is 
uniquely determined up to a real constant factor at each point x of the manifold by the equation 

P*(DP x (N» mooth )) = . (24) 

The polynomials P x and Pf given by P and P* at any given point x of the base manifold are 
then called dual to each other, and it is convenient to also call the corresponding cotangent bundle 
function P and tangent bundle function P# dual to each other. For a cotangent bundle function 
P that is reducible into irreducible factors 

P(x,k) = Pi(x,k)---P f (x,k), (25) 

we define the dual tangent bundle function as the product 

P*(x,v) =Pf(x,v)---Pf{x,v), (26) 



where the P 4 are the irreducible duals of the irreducible Pj determined by equation (24). Thus 
P# is uniquely determined up to a real factor function on M and satisfies again equation ( 24 ) , as 
one easily sees from application of the product rule. 
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The proof for the existence of a dual P x , and indeed its algorithmic computability for any 
reduced hyperbolic polynomials P, is provided by a branch of algebraic geometry known as elimi- 
nation theory. This is where the real Nullstellensatz, discussed at the end of section [T] and shown to 
hold for any reduced hyperbolic polynomial at the end of the previous section, becomes essential. 
For if the real Nullstellensatz holds, Proposition 11.10 of |44| asserts that the polynomial conditions 
which a vector X must satisfy, in order for there to be a solution to the problem of having the 
polynomials 

P x (k), X^-DP x (k) l \ X** - DP x (kyo (27) 

all vanish for some k, are obtained by first calculating an elimination ideal. How this is done in 
practice, by using Buchberger's algorithm and Grobner bases, is explained most lucidly in . The 
so calculated elimination ideal, however, may turn out to be generated by several real homogeneous 
polynomials. However, making use of the fact that we are dealing with real polynomials, it is easy 
to construct the dual as a sum of appropriate even powers of the generating polynomials, 
which obviously vanishes where and only where all generators vanish. It should be said that while 
for most polynomials of interest, a direct calculation of dual polynomials using elimination theory 
exhausts the capability of current computer algebra systems , in many cases one is nevertheless able 
to guess the dual polynomial by physical reasoning (as we will illustrate for the cases of metric and 
area metric geometry at the end of this section). Once such an educated guess has been obtained, 



one may directly use the defining equation (24) to verify that one has found the dual polynomial. 
In any case, since its existence is guranteed, we will simply assume in the following that a dual P& 
has been found by some method. 

Equipped with the notion of the dual polynomial, we may now return to the projective gradient 
map 

[DP X ] : [N*™° th ] [Nf] , [q] ^ [DP x {q)\ (28) 



first encountered in (23), where the brackets denote projective equivalence classes, identifying 
parallel vectors (respectively covectors), but not antiparallel ones, and Nf is the image of _/V| mooth 
under DP X . The projective map [DP X ] is well-defined due to the homogeneity of P x , and will be 
refered to as the Gauss map. The problem of inverting the Gauss map is now solved by definition 
of the dual Gauss map [DP X ] in terms of the dual polynomial P x , 

[DP*] : [N# smooth ] -+ [N x ] , [X] ^ [DP#{X)\ , (29) 
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since we then have for null covectors k € N^ mooth that 



[DP#]([DP x ]([k])) = [k] if det(DDP x )(k) ? 0. 



(30) 



so that the dual Gauss map [DP*] acts as the inverse of the Gauss map on the images of all 
covectors k satisfying the above determinantal non-degeneracy condition. That relation ( 30 ) holds 
is most easily seen from rewriting the duality condition (24) in the form 



P*(x, DP{x, k)) = Q{k)P{k) for all covectors k , 



(31) 



since this form does not require an explicit restriction to null covectors. Thus differentiation with 
respect to k yields, by application of the chain rule and then of Euler's theorem [69J on the right 



hand side, for any null covector k satisfying the non-degeneracy condition in (30) that 



DP#(x,DP(x,k)) = Q{X p k) k, 
dee P — 1 



(32) 



which in projective language takes the form (30). In particular, we may thus solve the projective 



equation (23) for 



[q] = [DP*)([x/\]). 



(33) 



Obviously, the homogeneity of DP* in conjunction with the projection brackets allows to disregard 
the function A altogether. However, another undetermined function fj, appears when translating 
this result back to non-projective language, 



DP* I 



(34) 



Now we may replace the momentum in (22) by this expression and use again Euler's theorem 



applied to the homogeneous polynomial P x to finally obtain the massless point particle action 



h [ X M= j dT»P*{ X , X ) 



(35) 



Relations (23) and (33) reveal the physical meaning of the Gauss map [-D-Pr] and its inverse [DP X ]: 



up to some irrelevant conformal factor, they associate null particle momenta in _/vj mooth with the 
associated null particle velocities in N* smooth . The automatic appearance of a final Lagrange 



multiplier [i in (35) also hardly comes as a surprise, since it is needed to enforce the null constraint 



Pf{x) = 0. This reveals the direct physical relevance of the dual tangent bundle function P* as 
the tangent space geometry seen by massless particles. 
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Illustration: Dual polynomials for metric and area metric geometry 

As an illustration of the above abstract theory, we provide the explicit form of the dual tangent 
bundle functions P# associated with the cotangent bundle functions induced by abelian gauge 
theory on first metric and then area metric geometry. 

For a metric manifold (M, g), we saw that the cotangent bundle function was given by P g (x, q) = 
g x 1 {q,q)- It is eas y to guess its dual, namely P g (x,v) = g(v,v). Indeed, Pf x {DP gx (q)) = 



Ag x (g x (g, -),g x (q, •)) = ^g x = 4:P gx (q), so that equation (24) is satisfied. Thus we conclude 

jj 

that P gx (x, v) = g x (v, v) is the dual polynomial of P gx (q). 

The case of an area metric manifold (M, G), where the cotangent bundle function Pq induced 



by abelian gauge theory is given by (15), is already cosiderably more complicated. At first sight it 
could seem that there is no way to avoid the use of elimination theory. However, already in four 
dimensions, elimination theory is prohibitively difficult for current computer algebra programs, even 
if full use is made of our knowledge of normal forms for area metrics |49j . So while in principle 
Bucherberger's algorithm applies, practically one is better off obtaining an educated guess for what 



the dual polynomial might be, and then verifying that guess employing equation (24). Thanks to 
the invertibility properties of area metrics, an educated guess for the dual of Pq can be derived 
directly from Maxwell theory [50] . In the language of |50| . a wave covector field q for an abelian 
gauge field field strength F is a section of T*M satisfying 

g A F = 0, q A H = , (36) 

where the constitutive relation between the field strength F and the induction H is given as 



H ah = -l/MdetG\ 1 / 6 oj Gabmn G mn P cl F pq . Solving (36) leads to the Fresnel polynomial (15). Dually, 



and this is the key idea, ray vector fields v on the discontinuity surface determined by a wave 
covector field q are defined as 

F(v,-) = 0, H(v,-) = 0, (37) 



where F and H are solutions of (36). Looking for solutions of the system (37), one finds that the 



condition for their existence is that the ray vector fields v must satisfy the polynomial equation 

P *» = -^(^I)™(^) rS * n G, m nKaG a;b | ps | c G xd)gt ^^^ c ^ = , (38) 

which for physical reasons should present precisely the dual tangent bundle function associated 
with Pq. Indeed, using the algebraic classification of area metrics [39], it is then a simple exercise 
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to verify that for metaclasses I-XI and XIII-XIX, the cotangent bundle function Pq defined in 



(38) satisfies at every point the defining property of the dual polynomial (24). But as we will 
see in the illustrations at the end of the next section, area metrics of metaclasses VIII-XXIII can 
never give rise to viable dispersion relations. Anticipating that result, we recognize Pq as the dual 
polynomial to Pq for all viable area metric spacetime geometries. 



4. BI- HYPERBOLIC AND ENERGY-DISTINGUISHING DISPERSION RELATIONS 

In this section, bi-hyperbolicity (meaning that both P and P& are hyperbolic) and the energy- 
distinguishing property are introduced as further conditions on the geometry. These further 
conditions are needed in order to provide an unambiguous notion of observers and positive energy. 

We saw that the dual polynomials defined by the tangent bundle function P& in each tangent 
space play an essential role . In fact, one needs to restrict attention to dispersion relations that 
are bi-hyperbolic, meaning that both P and its dual P# are hyperbolic. This is because only then 
may one select one hyperbolicity cone of (which is defined, mutatis mutandis, precisely 
as the hyperbolicity cones of P in section [2]) and stipulate that it contain the tangent vectors to 
admissible observers at a spacetime point. We will prove a non-trivial consistency result concerning 
the stability of the so defined observers in section [sj Physically, a choice of corresponds to a 
choice of time orientation of the manifold. The point is that having chosen the observer cone C* 
in the tangent bundle, one can immediately see that those momenta p at a point x whose energy 
is positive from every observer's point of view, constitute a convex cone 

(Cf) 1 = {pe T*M | p(v) > for all v G C*} . (39) 

For the Lorentzian metric case, this is simply (the closure of) what has been chosen as the forward 
cone, while in general the situation is more complicated; see the illustration at the end of this 
section. If the polynomial P is of the product form ([7]), we find that the positive energy cone is 
simply the sum of the positive energy cones coming from the duals of the factors of P [59] , i.e. 

{C #) ± = (C*) ± + --- + (C*) ± , (40) 

where the sum of two convex sets is just the set of all sums of any two elements of the two sets. 

That hyperbolicity does not imply bi-hyperbolicity is illustrated by the counterexample in figure 
|4j Bi-hyperbolicity indeed presents a rather stringent condition on dispersion relations, and thus 
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cotangent space tangent space 

FIG. 4: Example of a hyperbolic polynomial with non-hyperbolic dual polynomial. 

on the underlying geometry. This is illustrated at the end of this section, first for the case of metric 
geometry, where bi-hyperbolicity amounts to the requirement that the metric be Lorentzian, and 
second for the case of area metric geometry, where a similar exclusion of algebraic classes follows 
directly from bi-hyperbolicity. A similar study may and needs to be conducted for the reader's 
favourite candidate for a spacetime geometry. In the following we focus on some conclusions that 
can be drawn independent of any particular geometry and which will be important for our further 
theoretical developments. 

Having guaranteed an observer independent notion of positive energy, the only thing left is to 
ensure that any massless momentum q has either positive or negative energy. More precisely, we 
require the set N of massless non-zero covector fields to disjunctively decompose into positive and 
negative energy parts 

N = N + UN~, (41) 

where N + is defined as the intersection of N with the positive energy cone (C*) -1 -, and N~ 
as the intersection with the negative energy cone (— C*) -1- . We will refer to such bi-hyperbolic 
cotangent bundle functions P as energy-distinguishing. Figure [5] shows the vanishing sets of an 
energy-distinguishing bi-hyperbolic polynomial P x and its dual Px- 

For dispersion relations that are both bi-hyperbolic and energy-distinguishing, we find that the 
set of massless momenta N x cannot contain any null planes in spacetime dimensions d > 3, which 
in turn implies that the degree of P cannot be odd. This will be of importance later, and is seen 
as follows: First, we prove that bi-hyperbolicity of P x implies that 

closure(C# ± ) n -closure(C# ± ) = {0} . (42) 

Let ko be such that fco G closure(C*" L ) and ko G — closure(C*" L ). It follows from the definition 
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cotangent space tangent space 

FIG. 5: An energy-distinguishing bi- hyperbolic dispersion relation. 

of the dual cone that the following inequalities are true for all x G Cf : x.ko > and x.ko < 0. 
If this would be true the hyperbolicity cone Cf had to be a plane or a subset of a plane. That 
would contradict the property of Cf to be open. Second, suppose that the zero set N x contains a 
plane. From closure(C*" L ) n — closure(C*" L ) = {0} it follows that Cf 1 ' \ {0} is a proper subset of 
a halfspace. A proper subset of a halfspace cannot contain any complete plane through the origin. 
Hence the existence of a null plane of P x would obstruct the energy-distinguishing property. 
Third, this fact immediately restricts us to cotangent bundle functions P of even degree. For 
suppose deg-P was odd. Then on the one hand, we would have an odd number of null sheets. On 
the other hand, the homogeneity of P implies that null sheets in a contangent space come in pairs, 
of which one partner is the point reflection of the other. Together this implies that we would have 
at least one null hyperplane. 

At this point in this article, we have arrived at the insight that a physical dispersion relation 
for massless point particles must be given by a cotangent bundle function P that is a bi-hyperbolic 
and energy-distinguishing reduced hyperbolic homogeneous polynomial in each fibre. These are 
now all the conditions on P we identify in this work. The following two sections serve to show that 
the theory extends to massive dispersion relations and allows for all kinematical constructions one 
needs to provide a physical interpretation of quantities on the spacetime manifold. The final two 
sections are then devoted to further embellish the theory, in particular to make contact to non- 
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covariant representations of dispersion relations and to derive a generic mechanism for a covariant 
energy cut-off on non- metric spacetimes. 

Illustration: Exclusion of algebraic classes of metrics and area metrics 

The polynomial defining the massless dispersion relation on a metric background g is given as 
Pgx(o) = 9 x 1 (q^q)i as seen 111 section [TJ It is easy to verify that P g is bi-hyperbolic if and only 
if g is a metric of Lorentzian signature and that its dual is given as P gx {v) = g x (v,v). Moreover, 
hyperbolicity of P g also implies the energy-distinguishing property. For from the explicit definition 
of C# we know that at every point x and for every vector X G the covector g x {X, •) G (Cjf )-*-. 

44- 44- _ 

Arranging for P gx (C x ) > and knowing that P g is hyperbolic, it is easy to show that g x [0J, v) > 

4+4+ 4+ 

for every vector uj G dC x and v G C x , which shows that for every vector u G dC x the covector 
g x (ui, •) G {Cf) 1 - and g x (-oo, •) G -(C#) ± . More precisely g x {dd , ■) G (C^)^ and g x {-dd , •) G 
— (C*) -1 ". But g x {dCf , •) G (C*) -1 " is the image of the dual Gauss map induced from P gx ^ when 
applied to dC x . Thus, we conclude that Lorentzian metric geometry is a bi-hyperbolic geometry 
of the energy-distinguishing type. 

For the area metric case, bi-hyperbolicity serves to exclude the algebraic area metric metaclasses 
VIII to XXIII. That result was implicitly obtained in |49j . specifically it is contained in Lemma 4.1 
of that work, which asserts that there exists a plane of massless covectors for any four-dimensional 
area metric manifold belonging to metaclasses VIII to XXIII. But since the existence of a null 
plane does not allow for a bi-hyperbolic and energy-distinguishing P, the area metric algebraic 
metaclasses VIII to XXIII must be discarded as viable spacetime geometries. 

5. COVARIANT DISPERSION RELATIONS II: MASSIVE PARTICLES 

In this section, we extend the theory to massive dispersion relations and define the set of 
positive energy massive particles. Bi-hyperbolicity and the energy- distinguishing property, orginally 
introduced to guarantee a well-defined duality theory between massless covectors and massless 
vectors, are shown to also play a crucial role when discussing massive matter. In particular, they 
ensure reverse triangle and inverse Cauchy-Schwarz inequalities. 

For a bi-hyperbolic and energy-distinguishing dispersion relation, there is always a hyperbolicity 
cone in cotangent space that is of positive energy with respect to a chosen time orientation . 
For let C x be some hyperbolicity cone of P x , whose boundary dC x we know to be a connected set of 
null covectors. Now on the one hand, the complete zero set of P x is contained in (C X ^)~ L U —(C x ^) ± 
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due to the energy-distinguishing property. On the other hand, we have that (42) holds. Hence 
either C x or — C x is of positive energy. 

The covector fields in the thus selected positive energy cone C play two related roles. The first 
role, from the point of view of the field theory, was that a hypersurface can only be an initial data 
surface if its normal covector field lies in C. Now in order to identify the second role of the cone 
C in relation to massive matter, first observe that within the hyperbolicity cone C, the sign of P 
cannot change, so that we may arrange for P to be positive on C without upsetting any of the 
constructions made so far. We will assume from now on without further comment that this choice 
has been made. But then we have for any momentum q in C x at a spacetime point x that 



Px(q) = m de s p 



(43) 



for some positive real number m > 0, which we call the mass associated with the momentum 
q. It must be emphasized that the definition of mass associated to a momentum, as provided by 
(43), hinges on the choice of a particular volume density ujq i n P])- Physically this is understood 
from the need to convert mass densities in field theory into point masses in particle theory, which 



conversion requires a definition of volume. But then (43) represents a massive dispersion relation 



whose mass shells foliate the interior of C x , see figure [6j An immediate physical consequence of 
the convexity of the cone C x is that even for modified dispersion relations, a decay of a positive 
energy massless particle into positive energy massive particles is kinematically forbidden. 




P(q) = rn 




P(q) = nT 



cotangent space 



cotangent space 



FIG. 6: Mass shells defined by bi-hyperbolic energy-distinguishing cotangent bundle functions P. On the 
left the familiar second degree Lorentzian case; on the right a fourth degree case defined by a product of 
two Lorentzian metrics. 



At this point we derive a further important consequence of bi-hyperbolicity and the energy- 
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distinguishing property, namely that together they imply completeness of the polynomials P x 
defined by P in each cotangent space; in the terminology of [57], a hyperbolic polynomial P x is 
called complete if the lineality space 

L(P) = {a e T*M | for all y G T* X M and A G M : P(y + Xa) = P(y)} (44) 

only contains the zero covector. In other words, in order to be complete, P must depend on all 
covector components in any chosen basis. Geometrically, completeness can be read off from the 
closure of the hyperbolicity cones since [60] it is equivalent to 

closure(C(P x , h)) n closure(C(P :c , -h)) = {0} . (45) 

That completeness is already implied by the energy-distinguishing condition can be easily seen 
from this. For picking up the argument given at the start of this section, we know that 

closure(C#- L ) n - closure {C* 1 ) D closure (C^) n -closure(C x ) . (46) 

Thus if the right hand side differs from {0} (meaning that P is incomplete), the left hand side 
will contain non-zero covectors, too (showing that P is not energy-distinguishing). Because of the 
inclusion, this only holds in this direction. We conclude that the energy-distinguishing property 
already implies completeness. 

There are three principal reasons why it is so important that completeness holds. First, com- 
pleteness will play a crucial role in ensuring, as we will see in the next section, that there is a 
well-defined duality theory associating massive covectors with their vector counterparts. Thus 
remarkably, bi-hyperbolicity and the energy-distinguishing property, originally conceived in the 
context of massless dispersion relations, also take care of this in the massive case, via complete- 
ness. Second, since we arranged for P x to be positive everywhere on C x for the massive dispersion 
relation to make sense, we have the reverse triangle inequality 

P^ desP (h + k 2 ) > P 1/dcsP (h) + P^ dc z p {k 2 ) (47) 

for all k± and k 2 in the same hyperbolicity cone C x . Equality holds if and only if k\ and k 2 
are all proportional. Physically, the reverse triangle inequality generalizes a familiar result from 
Lorentzian geometry to any viable dispersion relation in our sense, namely that the decay of a 
massive particle generically gives rise to a mass defect. Third, defining the tensor 

degP /dimV „ \ 

p*(ki,...,k degP )= w ^ n ^{.kj^jPM, (48) 
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as the totally symmetric polarization of the polynomial P x , we can also formulate a reverse Cauchy- 
Schwarz inequality 

P x (k u ...,k degP ) >P x (ki) 1/degP ---P x (kd cg p) 1/degP (49) 

for all k\ , . . . , kdeg p in the same hyperbolicity cone C x . Similar to the reverse triangle inequality 
above, equality holds for the reverse Cauchy-Schwarz inequalities if and only if all arguments ki 
are proportional to each other. 



6. VECTOR DUALS OF MASSIVE MOMENTA: LEGENDRE MAP 

For a bi-hyperbolic and energy- distinguishing P, we introduce the action for free massive point 
particles. In contrast to the polynomial tangent space geometry P* seen by massless particles, the 
dual geometry seen by massive particles turns out to be encoded in a generically non-polynomial 
tangent bundle function P* . This largely obstructs attempts to devise a non-trivial Finslerian 
or Lagrangian tangent bundle geometry that describes massive and massless point particles 
simultaneously. 



We wish to associate vector duals with massive momenta (having done so for massless momenta 
in chapter [3]) , and to this end we employ the Helmholtz action 

I[x,q,X]= I dr \q a ± a - XmlnPix, — )1 , (50) 
J L ml 

which describes particles that are free due to the form of the first term, and massive since the 
massive dispersion relation P(x,q) = m dcgP is enforced through variation with respect to A. The 
particular form of the Lagrange multiplier term here has been chosen for the technical reason of 
having available the theory of Legendre duals on the open convex cones C x , see [59J . More precisely, 
the so-called barrier function, 

f x :C x ^R, f x (q) = lnP 2 (g) , (51) 

which we employed in the massive particle action above, is firstly guaranteed to be strictly convex, 
i.e., for each A G [0, 1] we have f x ((l — X)v + Xw) < (1 — X)f x (v) + Xf x (w) for all v,w in the 
hyperbolicity cone C x , due to the completeness of P |60j . which in turn is guaranteed by the 
energy-distinguishing property, as we saw in the previous chapter; secondly, near the boundary of 
the convex set, it behaves such that for all q G C x and b £ dC x 

lim (D q _ b f x )(b + X(q-b)) = 0, (52) 

A-s>0+ 
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which property is known as essential smoothness in convex analysis. The important point is that 
strict convexity and essential smoothness together ensure that the barrier function f x induces an 
invertible Legendre map 



L x '■ C x — > L X (C X 



q^-(Df x )(q) 



(53) 



and a Legendre dual function 



f£ : L X {C X ) -»■ M, g(v) = -L x \v)v - f x (L~ l (v)) 



(54) 



which can be shown, ultimately by virtue of the above conditions, to be an again strictly convex 
and essentially smooth function on the open convex set L X (C X ). Note that the two minus signs in 



(54) are correct, and due to our sign conventions. In fact, the inverse Legendre map is the Legendre 
map of the Legendre dual function f L : 



Dfx = L-\v) + DL-\v)v + DL-\v)Df x {L-\v)) = L~» 



(55) 



In other words, the Legendre dual of the Legendre dual (L X (C X ), f x ) of (C x , f x ) is again (C x , f x ), 
see theorem 26.5 of |59|. 




cotangent space 



Legendre map 




tangent space 



FIG. 7: Mass shell and Legendre map of massive momenta to tangent space. 



The existence of this Legendre theory now enables us to eliminate the q and A degrees of 
freedom, in order to obtain an equivalent particle action I\x\ in terms of the particle trajectory x. 
In the process, we will identify the definition of proper time that renders the law of free particle 



motion simple. Variation of the action (50) with respect to q yields x = (XdegP)L x (q/m), which 
we know may be inverted to yield 



q = mL x (x/(X deg P)) . 



(56) 



27 



It is now obvious why it was convenient to encode the dispersion relation by a Lagrange multi- 



plier term involving the barrier function (51); while many other ways to enforce the very same 
dispersion relation of course do exist, the latter allows to make use of the above theory of Legendre 
transformations in a straightforward manner. Using the thus obtained relation and the definitions 
of the barrier function and the Legendre dual to eliminate q, one obtains the equivalent action 

I[x,X] = -mdegP J dr Xf L (±/(\degP)) = -mdegP J dr [\f£(x) + Aln(AdegP)] , (57) 

where for the second equality we used the easily verified scaling property f L (ax) = f L {x) — In a. 



From variation of the action (57) with respect to A, we then learn that 

f L (x) + ln( A deg P) + 1 = . (58) 

Using this twice, we have Xf x (x) + Aln(AdegP)) = —A = — exp(— f x (x) — 1)/ degP. Noting that 
because of x E L X {C X ) we also have L~ 1 (x, x)(x) = 1 and thus f x (x) = —1 — f x {L~ 1 {x)), and 
defining the tangent bundle function 

P* : L X (C X ) -> R , P* x (v) = P x {L-\v))- 1 , (59) 



we eliminate A in (57) and finally arrive at the equivalent action 

I[x\ = m J dTP"(x,x) 1/dcsF (60) 

for a free point particle of positive mass m. While the tangent bundle function P* is generically 

non-polynomial, it is elementary to see that it is homogeneous of degree degP, and for later 
reference we also display the useful relation 

x 1 DP*(x,v) 

L * {V) - degP P*(x,v) ■ (61) 



The action (60) is reparametrization invariant, as it should be. However, parametrizations for 



which P(x, L x (x, x)) = 1 along the curve are distinguished since they yield the simple relation 

x = L x (q/m) (62) 

between the free massive particle velocity x and the particle momentum q everywhere along the 
trajectory x. As usual, we choose such clocks and call the time they show proper time. Thus we 
have established the physical meaning of the Legendre map, and may thus justifiedly call the open 
convex cone L X {C X ) the cone of massive particle velocities, and the function P* the massive dual 
of P, which indeed encodes the tangent bundle geometry seen by massive particles. 
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Reassuringly, we can now prove that the observer cone lies in the massive dual, C* C L X (C X ). 
Thus one may think of observers as massive, as usual. The converse, however, does not hold, since 
the inclusion is generically proper. Since this statement, in slightly refined form, will be of central 
importance again in chapter [8j we will formulate it by way of two lemmas: 

Third Lemma. For any reduced hyperbolic homogeneous cotangent bundle function P we have 
L X (C X ) = interior(C^). 

Proof. Since by assumption P x is reduced, hyperbolic and homogeneous, we get from the 
First and the Second Lemma in chapter [2] the statement: for all p S T*M \ closure(C x ) there 
exists an r £ dC x such that p.DP x (r) < 0. Since p.DP x {q) is a continuous function of q, we 
conclude that for all p G T*M \ closured^) there exists an q £ C x such that p.DP x (q) < 0. That 
implies that the set L x (C :r )- L is a subset of closure(C x ) \ {0}. Since L X (C X ) is convex, we get 
L X (C X ) 5 (closure(Ca;) \ {O}) -1 = interior (C x ). Furthermore, we know that L X (C X ) C C x . Since 
L X (C X ) is open it follows that L X (C X ) = interior(C^). 

Fourth Lemma. For any bi-hyperbolic and energy-distinguishing cotangent bundle function P, 
we have Cf C interior(C^-). 

Proof. From chapter [5] we know that there exists a hyperbolicity cone C x of P x that lies 
completely in (Cf)-^. From (C^) 1 - 5 C x and the fact that Cf is open, we conclude that 
Cf C interior^). 

Comparing these results with those of chapter |3j we see that there is a fundamental difference 
between the ways in which null covectors on the one hand, and massive covectors on the other 
hand, are mapped to the respective velocities on tangent space. In the null case, the Gauss maps 
[Di^] and [DP X ] associate massless particle momenta with the respective null velocities, up to an 
undetermined real factor. In the massive case, in contrast, the Legendre map L x and its inverse L^ 1 
afford the same for massive particle momenta and velocities. As a consequence, the dual geometries 
seen on the tangent bundle by massless and massive particles differ. For the former, the Gauss dual 
P# is the relevant structure, and for the latter the Legendre dual P*. We wish to emphasize again 
that while is polynomial in its fibre argument, P* generically is not. Indeed, to explicitly find 
the inverse Legendre map L , and thus P* , can be very hard in concrete applications, although 
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its existence and uniqueness are guaranteed. Also in this sense, the tangent bundle geometry 
(TM,Pft,P*) is considerably less straightforward than the cotangent bundle geometry (T*M,P) 
it dualizes. This explains to some extent the difficulties noticed by Skakala and Visser in [ST].|62j 
to identify a single Finsler-type tangent bundle geometry: generically there simply is no such 
geometry on tangent space that could give rise, dually, to a bi-hyperbolic energy-distinguishing 
dispersion relation. The case of a Lorentzian geometry presents one notable exception. 

On the positive side, on the cotangent bundle, any bi-hyperbolic and energy-distinguishing 
reduced homogeneously polynomial geometry provided by P provides a perfectly fine spacetime 
geometry as far as point particle theory is concerned. And if one wishes to consider the coupling of 
fields, one needs to couple to an underlying tensorial geometry G that gives rise, by the very same 
field equations, to the cotangent bundle function P at hand, as discussed in chapter [T] Again, 
the metric case is degenerate, since there one does not recognize the difference between the four 
different roles played by the metric: the inverse metric plays the role of the fundamental spacetime 
structure to which fields couple, as well as the role of defining the (structurally very different) 
cotangent bundle function P, while the metric plays the role of both the dual as well as 
the tangent bundle function P* , which define the tangent space geometries seen by massless and 
massive particles, respectively. All these different structures are, strictly conceptually speaking, of 
course already at play in the familiar metric case, but display their different nature explicitly only 
in the general case. 

7. E(p) FORM OF DISPERSION RELATIONS AND INERTIAL LABORATORIES 

The decidedly covariant discussion of dispersion relations, which we adopted in this paper, is 
required even if one ultimately prefers to represent the dispersion relation in form of a function 
E{p), which expresses the energy of a particle in terms of its spatial momentum. This is because 
the therefore needed temporal- spatial split of a covariant particle momentum itself is governed 
by the covariant version of the dispersion relation. Pushing the theory of observers and frames 
further, we identify a generically non-linear parallel transport induced by a bi-hyperbolic and 
energy- distinguishing dispersion relation and thus succeed in defining inertia! laboratories. 

Converting our covariant dispersion relations for massive and massless matter into non-covariant 
dispersion relations is conceptually and mathematically straightforward. For from the point of view 
of an observer carrying a clock that shows proper time (who is thus formally described by a P*- 
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unit vector eo in the observer cone C#, see the discussion following equation (|62|), any spacetime 
momentum p can be uniquely decomposed as 



p = EL~ 1 (e )+p., 



(63) 



namely into an energy E and a purely spatial momentum p satisfying p(eo) = 0. For a vi- 
sualization, see the corresponding tangent space split in figure |8j Employing such a particular 
observer-dependent split, one may solve the covariant dispersion relation 



P(x,EL-\e )+p) =m degP 



(64) 



for the energy E in terms of the spatial momentum p, and thus obtain an observer-dependent, non- 
covariant dispersion relation E = E{p). Keeping in mind that the latter depends in two ways on 



the cotangent bundle function P, namely indirectly through the temporal-spatial split (63) imposed 



by it and directly through the dispersion relation (64), this non-covariant version can be useful 



since it more directly relates to measurable quantities. However, due to Galois theory, we know 
that the energy will not even be an analytic expression in terms of the spatial momentum unless 
degP < 4, and not polynomial in any case. The crucial properties of bi-hyperbolicity and the 
energy-distinguishing property are even more hidden in the non-covariant formulation. This is of 
course the key reason for having dealt exclusively with a strictly covariant treatment of dispersion 
relations for all formal developments throughout this article. 



ann L (e ) 





ann L (e ) 



tangent space 



tangent space 



FIG. 8: Purely spatial directions with respect to eo are those annihilated by L 1 (eo). For the Lorentzian 
metric case on the left, this is coincides with the space of vectors g-orthogonal to ep. 



Conversely, the conversion of a non-covariant dispersion relation into a covariant one will be 
prohibitively difficult in most cases. This is essentially due to the fact that given a relation E = 
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E(p), the construction of a spacetime momentum p from the E and p, and indeed their physical 
meaning, is not directly possible without the cotangent bundle function P. We feel that this is 
often not considered where modified dispersion relations are proposed. Sometimes recourse to an 
'anyway' underlying spacetime metric is made, but it is hard to see how this would be consistent 
with the stipulation of a modified dispersion relation, due to the above double role played by the 
cotangent bundle function P. 

In the above discussion, it was sufficient to identify the purely spatial momentum associated 
with the unit timelike direction of a particular observer worldline. It is often useful to go further 
and to consider freely falling non-rotating observer frames. This is needed, for instance, if one 
wishes to determine the electric and magnetic field strengths seen by such an observer for a given 
electromagnetic field strength two-form F. But the definition of non-rotating frames requires to 
establish a meaningful parallel transport, and we will now see how the latter arises from a general 
dispersion relation. Since we saw in seection [6] that observers are necessarily massive, their free 
motion is governed by an action functional 



S[x} = j ' dTP*{x,±) l ' de z p , (65) 

which we know to represent the trajectories of point particles of non-zero mass. Using the 
reparametrization invariance to set P*(x,x) = 1 along the curve, it is straightforward to derive 
the equations of motion and, using in the following standard techniques of Finsler geometry |63| . 
to cast them into the form 



x a + T a (x,x) = (66) 



with the geodesic spray coefficients 



„\ _ 1 am ( ^9{x,v) mc 99(x,v) bm 99(^v)bc \ h c (f- 7 \ 



These in turn are constructed from the tangent space metrics g eo defined by 

1 d 2 P*(x,e + su + tv) 2 / de z p 



9(x,e y 



2 dsdt 



(68) 



s=t=0 



whose inverses appearing in the expression (67) are guaranteed to exist from the completeness of 



the cotangent bundle function P. Indeed, for eo = L(e°) 1 an explicit expression for the metric (68) 
in terms of f L is given by 

9(x,e )a b = Pf /de9P (e ) {-(DDrt(e )U + 2L~ l ^L' 1 b {e )) , (69) 
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and for its inverse in terms of / by 

<e0) = P x 2/de9P (e°) (-(DDf x (e°)) ab + 2L x a (e°)L x b (e )) , (70) 



where (DD f x (e°)) ab (DD f£ (L(e°)))b c = 5%. Remarkably, the Finsler metric (|69j) is automatically 



Lorentzian. We show this as follows. Consider a cotangent frame e a with e a (L(e )) = for all 



a = dimM — 1, then from expression (70) it follows that 



9lW a4, = P x 2/de9P (e°) > 0, (71) 

= 0. (72) 

But since any covector p on the spatial hyperplane defined by L x (e°) can be written as p = p a £ a , 
we have 

1&&P*<ZPP4 = -P 2 x /de9P (e Q )(DDf x (e a )T b p a ey^ < 0, (73) 
where the last inequality follows from the positive definiteness of the Hessian of / (see theorem 



4.2 and remark 4.3 of [60]). Thus we conclude that the metric (70) and hence its inverse (69) are 
Lorentzian. 



The metric (69) and its inverse (70) will be seen to provide a normalization for local frames 



which is preserved along free observer worldlines. The form of equation (66) indeed suggests to 
identify a parallel transport on the manifold M which, on the one hand, allows to recast the 
geodesic equation in the form of an autoparallel equation, and on the other hand, provides us with 
the means to define parallel transport also for purely spatial vectors. To this end, it is known to 
be convenient to define the derivative operators 

ft^-r^A, where ri iM:= ^M, (74) 

since now one can define, in full formal analogy to the Levi-Civita connection in metric geometry, 
the Chern-Rund connection coefficients 

^jk(u, v) = -x9\x,v) {5j9( x ,v)sk + dk9( x ,v)js - ^s9( x ,v)ih) ■ ( 75 ) 

These transform, due to the use of the 5i operators, precisely as a linear connection would under 
a change of coordinates x = x(x). It is then straightforward to see that for any vector w G L(C) 
and vector field u on M, one may define a new vector field with components 

{V w uY = w a d a u i + T{x, wy jk w j u k . (76) 
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Clearly, V w acts as a derivation on vector fiels, namely V TO (u + v) = S/ W u + V w v and V w {fu) = 
(wf)u + fV w u for any function / and vector fields u, v. Thus V w may be consistently extended 
to act on arbitrary tensor fields S, T on M by imposing the Leibniz rule 

V W (S®T) = (V W S)®T + S®(V W T) (77) 

for arbitrary tensor fields T and S. The derivation is not linear in its directional argument w, 
though, and thus amounts to what is often called a non-linear connection in the literature. Nev- 
ertheless, the non- linear covariant derivative V achieves the desired reformulation of the geodesic 



equation ( 66 ) as the autoparallel equation 

Vi x = . (78) 

The non-linear connection V provides sufficient structure for the discussion of freely falling 
non-rotating frames. The key technical observation is that for a frame field eo, ■ • • ,&d-i that is 
parallely transported along the first frame vector eo, 

V eo e a = , (79) 

we have the conservation equation 

eo (Se (e a ,e 6 )) = 0. (80) 
This means in particular that any normalization imposed on spacetime frames by virtue of the 



metric (69) is preserved along the worldline of a freely falling observer. In turn, (79) establishes a 



consistent notion of freely falling and non-rotating observer frames, and thus inertial laboratories. 



8. HIGH-ENERGY SIGNATURE OF MODIFIED DISPERSION RELATIONS 

The high- energy behaviour of massive matter changes not only quantitatively, but also 
qualitatively, if the standard relativistic dispersion relation is modified. In particular, there is 
a maximum energy which a massive particle can have without radiating off, sooner or later, 
a massless particle. More precisely, one finds that massive matter cannot not radiate off 
massless particles if and only if observers can ride on it. For the standard relativistic disper- 
sion relation, this is the familiar result that such radiation does not take place at all. For a 
modified dispersion relation, this reveals a covariant mechanism for an effective high energy cut-off. 
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Consider a process where a positive energy massive particle of momentum p radiates off a 
positive energy massless particle. We will now show that due to energy-momentum conservation, 
such a process is kinematically forbidden if and only if p lies in the stability cone 



(81) 



which in turn always lies entirely within the cone C x of massive momenta with positive energy. 
For the proof of these assertions, see further below; for an illustration, see figure [9j 

Specializing to the familiar case of the standard dispersion relation on a Lorentzian manifold, 
one of course obtains that L~^{Cf') = C x ; in other words, there is no Cerenkov radiation in vacuo. 
But for a modified dispersion relation where L~ 1 (Cj t ) C C x , there is a clear covariant mechanism 
for a dynamic energy cut-off: if a massive particle is made so energetic that its momentum 
leaves L~ 1 (C;jf), it may sooner or later radiate off a massless particle, or several ones, until its 
momentum lies within the stability cone. Reassuringly, observers' energies are obviously always 
below the energy cut-off. 



stable energy cone 




inverse Legendre map 



cotangent space 



observers 




tangent space 



FIG. 9: Stability cone: if and only if an observer can ride on a particle, the particle cannot lose energy by 
a vacuum Cerenkov process 



Now we turn to the proof of the assertion that the stability cone (81) contains precisely the 



momenta of those massive particles that cannot radiate off a massless particle in vacuo. To this 
end we will need to employ all four lemmas previously proven in this article: First of all we get 
from the Third and Fourth Lemma that every observer corresponds to a massive momentum, 
C L X (C X ) = interior(C^), so that L~ 1 [C^') is well defined and always lies within C x . It is now 
easy to see that a massive particle of mass m and positive energy momentum p may only radiate 
off a positive energy massless particle if there exists a positive energy massless momentum r £ iV+ 
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such that r(L x (p)) > 0. For consider the function 



u(A) :=-lnP x 



p — Xr 



m 



) 



(82) 



Since for any positive A, the covector — Xr G — (C x ) lies in some half-space of the cotangent 
bundle, while p G C x C (Cf) 1 - lies in the corresponding other half, we conclude that for some 
Ao > 1 the line p — Xr will necessarily intersect the boundary of C x , so that lim^-i-Ao u(Xq) = +oo. 
Further, from theorem 4.2 and remark 4.3 of [60], we know that for a complete hyperbolic P x 
the Hessian of the barrier function — lnP x is positive definite. Hence, we find that u"(X) > 
everywhere on its domain. Now first assume that the massive particle of momentum p decays into 
a massive particle of the same mass and of momentum p — r and a massless particle of momentum 
r, thus respecting energy-momentum conservation. Then we have from the equality of masses for 
the ingoing and outgoing massive particles that u(0) = u(l) = 0. But because u"(X) > 0, the only 
way for the analytic function u to take the same finite values at A = and A = 1 while tending 
to +oo at some Ao > 1 is to have > u'(0) = —r(L x (p)). Conversely, assume that r(L x (p)) > 
for some r G N x . Then u'(0) < and we conclude by the mean value theorem that there must 
be a (because of u"(X) > unique) Ai with < Ai < Ao such that u(Xi) = 0, i.e., there is an 
outgoing particle of the same mass such that the process occurs. In summary, a massive particle 
of momentum p can radiate off a positive energy massless particle if and only if there exists an 
r G AT+ such that r(L x {p)) > 0. 

Now on the one hand, we have that p L~ l (Cf) if r(L x (p)) < for some r 6 N£. For 
then r lies certainly in (C*) -1 ", and thus r(L x (p)) > for all p £ L x (Cx). On the other hand, 
if p £ L x l {Cf), we have r(L x (p)) < for some r G N x . This one sees essentially from the 
fact that Cf is a hyperbolicity cone of -Pjf since then for every p L~ 1 (Cj t ) there exists some 
v on the boundary of C x such that DP X (v)(L x (p)) < 0, as is shown in the Second Lemma in 
section [2J Clearly, the image DPf{y) of v under the Gauss map DPf is then a massless covector, 
and it remains to be shown that it lies inside the positive energy cone C x . Since in an energy- 
distinguishing spacetime, a null covector is either of positive or of negative energy, it suffices to 
find a single y G C x with y(DP x (v)) > in order to show that DP x (v) lies indeed in the positive 
energy cone (Cx) . But this is easily established from the convexity of Cx . For then we certainly 
find some y G Cf such that y + v G C* ■ But then y(DPf(v)) = dP*(v + sy)/ds\ s=0 > 0. In 
summary, p G L x l (Cf) if and only if there exists an r G N x with r(L x (p)) < 0. 
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9. NON-PHYSICALITY OF GAMBINI-PULLIN AND MYERS-POSPELOV 

DISPERSION RELATIONS 

The developments of this paper are far from academic musings of only remote relevance to 
physics. Indeed, the identification of bi-hyperbolicity and the energy distingusihing conditions as 
inevitable properties of dispersion relations provides, once known, a simple algebraic check on the 
physical consistency of any given dispersion relation. How simple indeed it is to apply these condi- 
tions has already been shown when we derived that for certain classes of area metric geometries, the 
general linear electrodynamics formulated on such backgrounds satisfy the physicality conditions. 

In this section we now show that it is equally simple to extract from our results that some rather 
popular modifications of electrodynamics, namely those of Gambini-Pullin and Myers-Pospelov, 
indeed possess dispersion relations that reveal the underlying field theory to be non-predictive. In 
the case of Myers-Pospelov, hyperbolicity (and thus predictivity) can be restored, but unfortunately 
only at the expense of destroying the energy-distinguishing conditions (and thus a well-defined 
notion of positive energy). These theories thus do not have the physical interpretation that would 
be required in order to render observational investigation of bounds on their parameters meaningful. 
It is obvious that it is both necessary, and above all simple, to subject also any other proposal for 
modified dispersion relations to the same straightforward tests, and that not to do so is simply 
negligent from a physical point of view. 

Gambini-Pullin field equations. Gambini and Pullin [S] obtained a modified dispersion re- 
lation by studying the interaction Hamiltonian for electromagnetic and gravitational fields in a 
semi-classical approximation motivated by loop quantum gravity. More precisely, they found the 
following refined equations for the electromagnetic field 

V x B-d t E + aV 2 (V x B) = (83) 

V x E + d t B + aV 2 (V xi) = 0, 



with a being a length scale. In fact, it is easy to see that equations (83) are not well-posed. For if 



one defines u A = (E,B), equations (83) become 

D AB (d)u B = 0, (84) 



with Dab{9) a matrix- valued differential operator explicitely given by 



D AB {d) 



Sikdt e ijkdj + a tijkdididj 

a e ijk dj + e ijk dididj 5 ik d t 



(85) 
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where in the above expression is the standard Levi-Civita symbol and Einstein's summation 
convention is used. The polynomial det D(iq) is easily found to be 

P(q) = det(D(iq)) = -q 2 (q 2 + 2a(q ■ q) 2 - a 2 (q- q) 3 ) 2 , (86) 

with p 2 = Pq — p ■ p, so that the its principal part 

P(q ,q) = aY (q-q) 6 , (87) 

which is not hyperbolic. Even arguing that only lower order of a should be considered, the very 
same problem remains. Thus, the Gambini-Pullin field equations are not predictive, and the 
corresponding dispersion relation non-physical. It has been argued [64J that inadequate quantum 



states were considered by Gambini and Pullin in the obtainment of equations (83). Urrutia et. al. 
|65j performed a re-examination of Gambini-Pullin calculations, with a more careful motivation for 
the quantum states considered. However, these result in only slightly different refined equations 
(neglecting a non-linear term in the magnetic field) for the electromagnetic field, and a very similar 
analysis as above also shows that again the associated dispersion relation is not hyperbolic. 

Myers-Pospelov field equations. Mayers and Pospelov studied dimension 5 operators [11] leading 
to cubic modified dispersion relations. Specifically, they proposed the following modified equations 
for the electromagnetic field 

Dftd)A v = 0, (88) 

with D^(d) a matrix- valued differential operator explicitely written as 

Dftd) = U5 v a ^ v n a {n ■ dfd^ (89) 

where in the above expression, 7 is the free parameter of the theory, 77 is the standard Lorentzian 
metric 77 = diag(l, — 1, — 1, — 1), and n is a time-like covector with respect to 77, i.e. r/(n, n) > 0, 



which breakes Lorentz invariance. For the operator (89) one finds 

det(D(iq)) = (q 2 ) 2 [{q 2 ) 2 - 7 2 (n • qf (n a n? - rfV) q a q p ] . (90) 

From the above expression we read off the principal part 

P(q) = l 2 (p 2 ?{n-pf (n a n? - ^n 2 ) p aPp , (91) 

which for r](n,n) > is not hyperbolic. This is so, because the matrix n a n 13 — r] a/3 n 2 , under 
the assumption 77(71, n) > 0, is positive semi-definite, which implies that one of the factors of the 
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principal part of P(p), namely [n a n^ — 77°^ n 2 ) p a pp is not hyperbolic. Hence, the Myers- Pospelov 
field equations are non-predictive. Furthermore, even if one were to choose n such that rj(n, n) < 0, 
one would still have a null-plane due to the term n-p, which we saw at the end of section[4]to obstruct 
the energy distinguishing property, and thus to lead to a non-physical dispersion relation. Precisely 
the same argument also rules out the scalar and fermionic modified field equations presented in 

In conclusion, the field equations found by Gambini, Pullin and Urrutia in the framewok of loop 
quantum gravity, as well as the field equations found by Myers and Pospelov in the framework of 
effective field theory do not lead to physical dispersion relations. More precisely, there is no 
spacetime hypersurface £ on which initial data for the electromagnetic field could be given so 
that its values on a later hypersurface would be uniquely prescribed. Hence, phenomenological 
conclusions, such as the identification of bounds, based on these modified dispersion relations are 
unfortunately not conclusive. 

10. CONCLUSIONS 

Our investigation of viable dispersion relations in classical physics yielded far more than the 
identification of the three restrictive conditions we aimed at. Indeed, while deriving that a dis- 
persion relation must be encoded in a cotangent bundle function P that (i) defines a reduced 
homogeneous polynomial in each cotangent space (ii) is hyperbolic and has a dual tangent bundle 
function that is also hyperbolic and (iii) is energy-distinguishing, we collected a number of further 
important results along the way. 

First of all, the algebraic properties to be satisfied by a dispersion relation immediately restrict 
also the possible fundamental spacetime geometries from which the dispersion relation derives. 
While the connection between the dispersion relation (encoded in a cotangent bundle function 
P) and the fundamental spacetime geometry to which fields couple (encoded in some tensor G) 
played an explicit role only in sections [T] and [2j all the restrictions we derived in this paper for 
the cotangent bundle function P automatically translate into restrictions on the geometry G. 
Making this transparent has been one purpose of the illustrations throughout, where we saw how 
a fundamental geometry given by a symmetric second rank tensor, for instance, is automatically 
restricted to be a Lorentzian geometry (due to the requirement of bi-hyperbolicity of the induced 
dispersion relation), and then automatically energy-distinguishing. Thus we recovered, from first 
principles, the familiar result that Lorentzian manifolds provide a consistent fundamental spacetime 
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structure. The crucial point, of course, is that the very same principles and resulting conditions 
apply for any other tensorial geometry. We illustrated that while these physicality conditions do 
hold for certain classes of area metric geometry, they do not for two very popular proposals for 
field theories with modified dispersion relations. 

Second, we obtained a complete theory of observers and point particles. One remarkable ob- 
servation here was that massless and massive particles are governed by very different geometries 
on the tangent bundle. While massive particles are governed by a generically non-polynomial 
Finsler function P*, it is a in each fibre homogeneously polynomial tangent bundle function 
that governs the motion of massless particles. But both tangent space structures derive from the 
same cotangent bundle structure given by P. The distinction is generated due to fundamentally 
different duality maps between cotangent and tangent spaces, depending on whether one is dealing 
with massive or massless momenta. While in the former case the duality is given by a Legendre 
map, it is a projective Gauss map in the latter case. In both cases, the duality maps are generically 
non-linear, which replaces the ubiquitous linear algebra in metric geometry by the need for some 
elementary convex analysis and real algebraic geometry in the general case. 

Third, we found that even for modified dispersion relations, key properties of the Lorentzian 
kinematics still hold. One result is that observers can always be thought of as massive. Another 
one that the decay of a massless particle into massive ones is kinematically impossible. A third 
kinematical issue, namely the kinematic exclusion of processes where a massive particle radiates 
off energy in terms of massless particles, only holds below a certain energy threshold. The energy 
threshold, in turn, is encoded in a subcone of the positive energy massive momentum cone and thus 
presents a fully covariant notion. We find that while observers automatically respect that energy 
threshold, there are massive particle momenta exceeding it. But then a Cerenkov-type process, 
by which the massive particle radiates off energy in terms of a massless particle, is kinematically 
possible even in vacuo. Proving these assertions, which amount to a covariant geometric mechanism 
for an energy cut-off, required use of the entire machinery we developed for general dispersion 
relations. Of course, one cannot make any statement about the average decay time without studying 
the quantum theory of matter on such generalized geometries. 

Fourth, we saw that a non-covariant representation of dispersion relations, where the energy of 
a point particle seen by a particular observer is given as a function of its spatial momentum, is not 
even meaningful all by itself. The trouble is that the decomposition of a spacetime momentum into 
its energy and purely spatial momentum requires a temporal-spatial split, which in turn depends 
on the particular observer, by way of the cotangent bundle function P. This means that the 
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kinematical objects appearing in a non-covariant dispersion relation can only be given meaning 
by first obtaining the covariant formulation. In practice therefore, it is hard to start with a non- 
covariant formulation. Even worse, there may well be no covariant formulation at all from which 
a given non-covariant relation could be derived, in which case the latter is revealed to be in fact 
meaningless. It is therefore sensible, both conceptually and physically, to start from the covariant 
formulation, as we have done throughout this article. Even more so, since the crucial algebraic 
properties we identified for the cotangent bundle function are deeply hidden in a non-covariant 
formulation. 

Fifth, even the notion of freely falling and non-rotating frames hinges on the covariant dispersion 
relation at hand, and is provided by a generically non-linear connection on the tangent bundle. 
Thus the dispersion relation is also seen to have an effect on the interpretation of spacetime 
quantities (such as for instance the electromagnetic field strength two-form) in terms of quantities 
that are actually measurable in some laboratory and related to the covariant quantity through 
an observer frame (such as electric and magnetic fields). It is both remarkable, and important to 
realize, that the choice of a dispersion relation has such far-reaching implications. At the same 
time, our algebraic restrictions on the cotangent bundle function P, originally required for other 
reasons, single-handedly ensure that all required kinematical notions exist can be constructed. 

Sixth, the problem of finding a pseudo-Finslerian analogue of Lorentzian geometry is solved. 
The twist required by our findings is that the geometry is established by a single function P 
on the cotangent bundle, rather than the tangent bundle. Moreover, one can and must restrict 
attention to functions which are bi-hyperbolic and energy-distinguishing reduced homogeneous 
polynomials in each cotangent fibre. While we saw that all of these properties, and their interplay, 
are important for the geometry to provide a viable spacetime structure, it is bi-hyperbolicity 
in particular which generalizes the Lorentzian character of metrics to the much more general 
geometries studied in this work. Indeed, the same physical principle that selects Lorentzian 
signature amongst all possible metric geometries, namely the conditions of having well-posed 
matter field equations and a well-defined notion of positive particle energy, directly led to the 
above general conditions that cover also non-metric geometries. So while on the cotangent bundle, 
there is one single geometric structure over which we have excellent mathematical control, there 
are two very different structures induced by it on the tangent bundle, the duals P* and P#. 
The intuition behind many studies concerning a Finslerian generalization of general relativity has 
been correct in allowing for a very generic (non-polynomial) structure P* on the tangent spaces 
in order to describe massive point particles. However, we now know that attempts to describe 
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massless point particles with the same structure must fail in general: The massless particle motion 
governed by the polynomial structure will generically not coincide with the non-polynomial 
P* governing massive particle dynamics. So not even the most general Finslerian or Lagrangian 
geometry on the tangent bundle could possibly serve as a viable spacetime geometry in any other 
than the most special circumstances (such as the geometry actually being metric). Starting from 
the cotangent bundle, instead, one has the here developed theory available. 

With the results obtained in this work, we are now well equipped to address two key questions, 
which arise whenever a modified dispersion relation (or, equivalently, a modified fundamental 
spacetime geometry) is considered. 

The first issue is that one needs well-posed dynamical equations that can replace the Einstein 
equations. Remarkably, this apparently physical question is in fact a predominantly mathematical 
one: Dynamics for a particular spacetime geometry can be obtained from studying minimal repre- 
sentations of the deformation algebra of hypersurfaces in that very same geometry, so that the field 
equations are well-posed by construction. The geometric key ingredients are to have available, first, 
a Legendre map associating the normal covectors of suitable initial data surfaces with their vector 
duals, and second, a notion of proper time that provides a physically distinguished normalization 
of those normal vectors. For the Lorentzian case, this program has been carried out in seminal 
work by Hojman, Kuchaf and Teitelboim [66], and leads to the Einstein- Hilbert action with an 
undetermined cosmological constant as the unique such dynamics. The geometries identified in the 
present work are just the ones where these tools are indeed still available and thus the correspond- 
ing hypersurface deformation algebras can be derived straightforwardly. Finding respresentations 
in terms of the geometric variables directly leads to the corresponding non-metric gravity theories 
whose dynamics are well-posed by construction. In other words, with the tools developed in this 
paper, the physical art of constructing an alternative gravity theory for a given geometric structure 
has been reduced to a well-defined mathematical question in representation theory. 

On the other hand, one needs to understand the quantum theory of particles and fields. 
Equipped with the technical machinery we developed, one may now indeed start from the massive 
or massless point particle action and perform a first quantization, thereby obtaining field equations. 
Subtleties, which one can gloss over in the case of a Lorentzian background geometry without harm, 
now need to be taken into account. For instance, the restriction that momenta must lie within 
the hyperbolicity cone for massive particles, or on the cone of null covectors for massless particles, 
becomes of crucial technical importance in the non-metric case. In particular, this has important 
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repercussions for a second quantization of the field equations one obtained from the first quan- 
tization. Again, as in the question of identifying appropriate gravitational dynamics, the actual 
execution of the quantization may present, depending on the chosen geometry, a hard, albeit now 
well-defined, mathematical problem. 
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